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Neural Network Prediction of 
Aluminum-Lithium Weld Strengths 
From Acoustic Emission Amplitude Data 


by Eric v. K. Hill,* Peggy L Israel/ and Gregory L. Knotts* 


Abstroct 

Acoustic emission (AE) flow growth activity was monitored in 
aluminum -lithium weld specimens from the onset of tensile loading to 
'ai'ure. Data on actual ultimate strentghs together with AE data from the 
beginning of loading up to 25 percent of the expected ultimate strength 
were used to train a backpropagation neural network to predict ultimate 
strengths. Architecturally , the fully interconnected network consisted of 
an input layer for the AE amplitude data, a hidden layer 1c accommodate 
1 ailure mechanism mapping , and an output layer for ultimate strength 
prediction. The trained network was then applied to the prediction of 
uli imaiestraigthsih the remaming six specimens. The worst case prediction 
error was found to be + 2.6 percent . - 

K eyzvo rdsracoustic emission ,aluminum-l ithi um ,ampli tude d is tribu tion, 
rlqcrcpagation, failure mechanism, neural network , ultimate strength. 

INTRODUCTION 

The 2195-T87 aluminum-lithium alloy is being considered as a 
replacement materia] for the 2219-T87 aluminum currently in use 
on the Space Shuttle External Tank. Both materials exhibit good 
weldability and strength, but the aluminum-lithium is less dense 
and 25 percent stronger, thereby providing extra payload capacity; 
hence, the incentive for change. Because variable polarity plasma 
arc welding is the principal method of joining, and the welds are 
typically the weakest link -(Nunes et al, 1984), weld strength is 
paramount to external tank structural integrity. A method is devel- 
oped here for predicting ultimate weld strengths at proof loads as 
low as 25 percent of the expected ultimate using acoustic emission 
( AE) data. 

The AE data taken during proof loading have been correlated 
with ultimate strengths in both composites (Kalloo, 1 9SS; Hill, 1 992; 
and Walker, 1992) and in metals (Hill and Knotts, 1993). These 
correlations used such AE parametric data as amplitude, energy, 
and event rate to quantify the AE signals generated by the various 
i ailure mechanisms. Thus, AE data contain inform a tion concerning 
failure mechanisms which can be correlated to ultima te strengths in 
engineering materials. 

The different characteristics of the .AE signals correspond to 
different deformation and failure mechanisms during loading. For 
example, rubbing noises (from specimen grips) and plastic defor- 
mation generate AE signals that are similar to one another but 
ripically very much different from the AE signals produced by 
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crack growth or intermetallic precipitate fractures (McBride et al., 
19S1). SLnce each of these deformation and failure mechanisms 
contributes in varying degrees to the structural integrity of a 
finsihed part, in generating an ultimate strength prediction equa- 
tion, the AE associated with each mechanism must be weighted 
differently (have a different coefficient). The .AE events associated 
with rubbing and plastic deformation, which contribute tittle or 
nothing to the ultimate strength, w*ould have weighting functions 
(coefficients) approaching zero, while those that contribute signifi- 
cantly, such as crack grow’th or intermetallic precipitate fractures, 
would have large coefficients. Neural networks provide an auto- 
mated technique for sorting out the AE associated with the various 
m echanisms and a etermining the a ppropria t e coeffi ri en ts or wei gh t- 
ing functions (Sachse and Grabec, 1992). Therefore, neural net- 
works w f ere employed in this research to generate ultima te strength 
predictions from the AE amplitude data. 

NEURAL NETWORK TRAINING 

Acoustic Emission Amplitude Distributions 

Tne acoustic emission amplitude parameter, A [cB], is a 
logrithmic representation of the peak signal voltage, V |V], of the 
AE waveform 

(1) A = 20 log(V/VJ 

For most applications, V, = 1 pV at the sensor output is chosen 
as the 0 dB reference because it is the 3ow>est detectable voltage, just 
slightly above the noise level of the system electronics. Here the AE 
sensor contained a built-in 40 dB preamplifier; consequently, 0 dB 
w*as referenced to 1 OOpV at the preamplified sensor output (Mitchell, 
1984). 

Acoustic emission amplitude distributions (events vs. ampli- 
tude histograms) have been shown to contain information that 
allow 1 the identification of failure mechanisms in materials (Pollock, 
1 981 ). Tne (differential) amplitude distribution can represent peak 
signal voltages of the AE waveforms ranging from 1 00 pV (0 dB) to 
10 V (100 dB). The various failure mechanisms are typically seen 
grouped together as characteristic humps or bands in the ampli- 
tude distribution, and while the amplitude bands for such mecha- 
nisms as plastic deformation and crack growth are widely sepa- 
rated, there are other mechanisms w’hose characteristic amplitude 
bands overlap. This overlap’ in the AE failure mechanism ampli- 
tude bands is accentuated by attenuation effects, especially disper- 
sion (Miller and Mclntire, 1987). Here, because the specimens were 
small, the attenuation effects in the AE waveforms w^ere expected 
to be minimal. It was therefore hoped that the amplitude distribu- 
tions w’ould have enough separation in the failure mechanism 
bands to allow accurate prediction of ultimate strengths in the 
aluminum-lithium weld specimens. 

The amplitude distribution for specimen 01-5 is showm in Fig- 
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ure 1 Note that there are five distinct humps or AE amplitude 
(failure mechanism) bands and 307 total AE events Because the 
logarithms of measurements tend to have normal distributions 
(Tennant-Smith, 1985), the various failure mechaiusm humps in the 
amplitude distribution were approximated as such. This being the 
case, the first mechanism probably has amplitudes between 4-12 
dB; the second mechanism ranges from approximately 8-20 dB; the 
third from 19-29 dB; and the fourth from 34-35 dB. The fifth hump 
is a single event at 91 dB representing specimen failure. It can be 
seen that there is some overlap between the first and second and the 
second and third mechanisms; moreover, inasmuch as the first 
hump does not appear to be normally distributed, there may well 
be more than one mechanism buried within it (Kouvarakos, 1992). 
Modeling these humps and determining the effect of the various 
mechanisms is where neural networks come into play. 



Figure 1— Differential amplitude distribution for weld specimens 01-5. 


Neural Networks 

The beautv of neural networks lies in the fact that they can take 
into account the effects of the various failure mechanisms without 
explicitly identifying and/or isolating them from the rest of the 
data. Moreover, given the appropriate structure and input, neural 
networks can learn all the pertinent data interactions and make 
accurate predictions using the entire data set, regardless of extrane- 
ous data (such as AE from rubbing noises) or data overlap and 
without having to resort to multivariate statistical analysis. These 
brain-like responses are generated through the use of artificial 

neurons. . . . 

An artificial neuron in its simplest form is shown in Figure 2(a). 
Here the Wi are the weight vector components, and the x x are the 
input vector components. Within the artificial neuron each input 
vector component is multiplied by its respective weight vector 
component; then these products are summed up over all the Lnpu ts 
to yield the NET output 

(2) NET = li w,xj 

Such a neuron can be used to model linear processes or provide a 
linear mapping from the input to the output. 

In order to accommodate nonlinear mappings (solve nonlinear 
problems), a nonlinear activa tion function, F, must be applied to the 
NET output as shown in Figure 2(b). The nonlinear activation 
function employed here is the sigmoid (meaning S-shaped) or 
squashing function: 



As the value of NET ar>proaches large negative values, OUT 
approaches a limiting value of -0.5; when NET is equal to 0, OUT is 
equal to 0; and as NET grows large in a positive sense, OUT 
approaches a value of t 0.5. Figure 3 is a plot of this function. In 
addition to allowing the solution of nonlinear problems, the nonlin- 
ear °ain provided by the sigmoid activation function allows the 
same neuron to process both very large and very small inputs 
without noise saturation problems (Wasserman, 1989). This two 
laver network with multiple inputs and a nonlinear activation 
function is knowm as a perception. 

Many problems cannot be represented by a single nonlinear 
mapping. For such problems, multiple or nested nonlinear map- 
pings can be obtained by cascading layers of perceptions together 
to form multilayer networks. These consist of an input layer, an 
output layer, and one or more middle or hidden layers. It should be 
mentioned that multilayer networks provide no increase in compu- 
tational power over two layer networks unless nonlinear activation 
functions are included within each layer of neurons (Wasserman, 
19S9). 



Figure 3 — Sigmoid activation function. 
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From Figure 4 it can be seen that the amplitude histogram, 1(A), 
:s an approximation to the actual amplitude distribution, cf.4) 

■ Zurada, 1992). Each rectangle in the histogram (Figure 5a) can be 
modeled by a three layer network consisting of two perceptrons in 
parallel followed by a single neuron to sum their outputs (Figure 
. Binary activation functions in the perceptrons are used to create 
square comers of the rectangle: one perceptron to map the up- 
see and one to map the down-side. This binary approximation to 
the amplitude histogram is designated EfvXjAh where u’,) denotes 
the interconnection weight to neuron i from neuron;. When the 
binary activation functions are replaced by continuous activation 
functions, such as the sigmoid, the result becomes a smooth lire 
approximation, e(zvij, A), to the actual amplitude distribution, e(A), 
creating sigmoidal humps instead of rectangles (Figure 5a). Hence, 
this three layer network provides a nonlinear mapping followed by 
a linear mapping. More commonly, multilayer networks employ 
nonlinear activation functions throughout, meaning that each neu- 
ron layer produces a nonlinear mapping. Such is the case in this 
application. 


elA) 

EiA) 



Figure 4 — Amplitude histogram. 

Training is accomplished by adjusting the interconnection 
weights such that known input amplitude data produce known 
ultimate strengths as outputs. This means that inputting the ampli- 
tude distribution for specimen 0T5 should yield its ultima te strength 
of 51 .5 ksi (0. 355 kPa) as the output. The known input (amplitude 
distribution) and target output (ultimate strength) constitute w’hat 
is called a training pair. When both the input and the output are 
known, the training is designated supervised learning. Typically, 
several training pairs (specimens) are needed to train the network 
and arrive at the appropriate weight components. Once the net- 
work is trained, a set of test data is used to verify the prediction 
accuracy of the neural network The test data set, like the training 
set, consists of known inputs with known outputs. Thus, a predic- 
tion error can be calculated and network performance assessed. 

Bockpropogation 

Backward error propagation or backpropagation is a training 
method that compares the actual output of the network with the 
expected or target output, then backpropa gates adjustments in the 
weights proportional to the calculated error. The object of training 
is to adjust the weights such that the application of a set of inputs 
produces the desired or target output Hence, network training is 
two step procedure — forward and backward — propagating the 
inputs and their concomitant activations forward to the output 
layer, then propagating the error backward from the output layer 
t h rou gh the hidden layeris) to upgrade the interconnection weights. 

For the ANSim software package used herein, a bias neuron is 
included in every layer except the output layer (Figure 6). Each bias 



Figure 5 — (a) Amplitude histogram rectangle and (b) the 
neural network necessary to model it. 

neuron has a constant activation value of + 0.5 and is connected to 
all neurons in the succeeding layer (Dayhoff , 1 990) . The biases, like 
the interconnection weights, are learned during backpropagation. 
Thus, each forward pass through the hidden and output layer 
neurons implements the following activation equations (ANSim, 
1988): 

(4) NETi = 9j + Ij wipUTj 
and 

(5) OUTi = [1/(1 + e- NTT5 )l- 0.5 

where 0j is thebias for neuron i, and Wj } is the interconnection w'eight 
to neujon i from neuron;. 

The bias neuron provides a constant threshold term 0i in the 
weighted sum of the neurons in the succeeding layer w’hich trans- 
lates their sigmoid functions to the left or right depending upon its 
sign (Figure 7). This bias term results in an improvement on the 
convergence properties of the network during the training phase by 
keeping the sigmoids operating at their midrange values. 



Figure 6 — A typical three-layer network with bias neurons. 


1042 Materials EvatualfcyVSaplembef 1993 





3iases and weight adjustments in the various layers are calcu- 
lated using the generalized delta rule. This is a steepest descent 
method of computing the interconnection weights that minimizes 
the total squared output error over a set of training vectors (pairs). 
Each pass through the training set is called a cycle. After each 
training cycle, t, the following equation is used to calculate the total 
normalized root mean square (RMS) error, 5*, for output neuron i: 

(6) & = l[IpS,CTARGEti - OUTjfi/Pl) 17 * 

Here, p is the training pair or pattern, P is the total number of 
training pairs or patterns, i is the output neuron, 1 is the total number 
of output neurons, and TARGET = target output for neuron i. 

The weight matrix changes are then backpropagated through 
each layer; within the network using the equation 

(7) w jj(t+l) = q&OUTp + a w »j{t) 

where q is the learning rate, a is the momentum, &;OUTj is the 
current weight change, and w i} <t) is the previous weight change. 

In theory, gradient descent is guaranteed only if infinitesimal 
changes are made to the weights. Because this would make the 
training process unacceptably long, a settable learning rate, q, is 
introduced. The goal is to set the learning rate as high as possible 
without causing the RMS error to oscillate significantly. This term 
is typically set between 0.01 and 1 .0, depending upon the difficulty 
of the problem. The momentum term, a, is used to increase the 
learning rate without making the RMS error osdUate. The momen- 
tum determines what portion of the previous weight changes will 
be added to the current weight changes. This term is usually set to 
0.9. In summary, by applying the generalized delta rule for 
backpropagation training, a multilayer neural network can learn to 
develop whatever features are necessary to perform the desired 
mapping from the input pattern to the target output. 



Figure 7 — Sigmoid translation provided by threshold term 0 


EXPERIMENTAL PROCEDURE 

The eleven 5/16 by 1 by 12 in. (0.79 by 2.54 by30.4$ cm) tensile 
test specimens were cut from butt welded sheets. Both sheets were 
made of 2195-T87 aluminum-lithium with a 231 9 filler. In each case 
the weld was in the center of the specimen and perpendicular to the 
direction of pull. 

A 10 kip (45360 kg)/min linear load ramp was generated by a 
computer-controlled Tinius Olsen tensile test machine. The load 
cell output ranged from 0.00 to 1.00 V (0.00 to 36.3 kips [16465.68 
kg]). A Physical Acoustics Corporation (PAC) 3000/3004 acoustic 
emission analyzer and R15I piezoelectric sensor (with integral 
preamplifier) were used to collect the AE data. The AE sensor was 
taped to the specimen next to the weld and acoustically coupled 
using Sonotrace 40 ultrasonic couplant. The PAC 3000 / 3004 thresh- 
old was set at 0.3 V with a total system gain of 81 dB. This high 
system gain and low threshold were necessary to sense the pre- 
dominantly ductile failure mechanisms inherent in the aluminum- 
lithium alloy. 

In order to develop the technique, all eleven specimens were 


taken to failure with the .AE sensor attached. These data were to be 
used to train the neural network to predict ultimate strengths from 
the AE data collected up to 25% of the expected ultimate strength, 
which would then become the proof load. The value for the 
expected failure strength was obtained by taking the average of the 
ultimate strengths for the training set. Once the network had been 
trained, the procedure w’ould be to remove the sensor after the 
applying the proof load, then have the network make its ultimate 
strength prediction from the AE data taken up to that point. 

DATA INPUT AND NETWORK ARCHITECTURE 

The objective of this work was to predict the ultimate strength of 
a weld specimen from its AE amplitude data at proof loads w-ell 
below yield, such that no macroscopic damage is done to the 
specimen. On viewing the amplitude distributions for all the w T eld 
specimens, it was found that the AE events having amplitudes 
greater than 50 dB were all associated with ultimate specimen 
failure (Figure 1). Such events only occur at high loads and therefore 
are not appropriate input for the prediction of ultimate strengths 
from the AE data taken at lozo loads. Consequently, only the event 
data in the amplitude range from 1 to 50 dB were input to the 
network. 

In this case it w’as desired to map the extremely nonlinear curve 
E(A) produced by the fifty input amplitude histogram into a single 
ultimate strength output, S«. Given the appropriate number of 
neurons in the hidden layer, according to Kolmogorov's theorem 
(Caudill, 1988), a three-layer network exists that can perform this 
mapping exactly. For the problem at hand, the input dimension 
w'as m = 50, and the output dimension was n = 1; accordingly, a 
hidden layer of 2rn+l = 101 artificial neurons would be required to 
produce an exact mapping. Unfortunately, an exact mapping for a 
given amplitude distribution d oes not guarantee a good prediction 
capability for other amplitude distributions. 

Good prediction capability is obtained when the difference 
between the estimated amplitude distribution, efwjj, A), generated 
bv a multilayer network with sigmoid activated neurons, and the 
actual distribution, e(A), is minimized (Zurada, 1992). This error 
minimization is accomplished by making adjustments to the inter- 
connection weights Wij during supervised training using several 
representative input-output mappings or training pairs. Thus, an 
exact mapping is not desired — rather, a weighted approximation 
that minimizes the error over all the training pairs. 

Since the desired mapping was going to be approximate any- 
way, it was decided to simplify the problem further by introducing 
a second approximation. Grouping the amplitudes into the previ- 
ously discussed failure mechanism bands, the size of the hidden 
laver, and hence, the training time, was reduced considerably. 
Given that there were four failure mechanisms within the range of 
1-50 dB (Figure 1), a reasonable guess for the reduced dimension- 
ality of the problem was m = 4; whereupon, the number of neurons 
in the hidden layer necessary to exactly map the four humps 
became 2m+l = 9, which agreed quite well with the two neurons 
(perceptions) per hump rule suggested previously. The sigmoid- 
generated humps shown in Figure 8 represent an approximation to 
the normally distributed humps of Figure 1 . 

Prior to training the network, the interconnection weights were 
initialized to small random numbers between - 0.5 and + 0.5, and the 
amplitude and ultimate strength data were normalized to fit into 
the same range (Lawrence, 1991). Designating the total number of 
acoustic emission events in the amplitude distribution up to the 
proof load as Em**, the normalized events at any given amplitude, 
NE(A), were obtained from the equation 

(8) . NE(A) = [E(A)/Eoux] -0.5 

where E(A) was the number of AE events as a function of ampli- 
tude, A, from the amplitude distribution. The actual ultimate 
strengths, 5«, were normalized similarly: 

(9) NSg = {[Svj - (Su)i]/l(Su)u - (Su)*JI - 0.5 

with NSu the normalized ultimate strength, (SJi the low-er range ot 
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Figure 8 — Sigmoid-generaied failure mechanism hwnps. 

the expected ultimate strengths; and (SJ U the upper range of the 
expected ultimate strengths. 

Here Ema was chosen to be 250 (slightly greater than the maxi- 
mum number of events in any one of the eleven AE d a ta sets); while 
«' S- m h was set at 45 ksi (0.31 0 kPa); and (5 J u was set to 55 ksi (0.37 kPa). 
Al 1 of the above normalizations were effected to facilitate network 
training.* 

As .was mentioned previously, several training pairs (speci- 
mens) were needed to train the network. It was also necessary to 
have sufficient variety in the training inputs such that the network 
would be able to make valid generalizations for unfamiliar cases — 
a good distribution of possible inputs and outputs, plus any border 
'■.ases (Lawrence, 1 991 ). Therefore, the five specimens chosen for the 
aining set included the specimens with the highest ultimate 
strength, the lowest ultimate strength, and three intermediate 
values: 01-5, 01-8, 01-12, 01-13, and 01-14. 

Some general rules apply when developing the network archi- 
tecture for optimum performance (Bailey and Thompson, 1 990). T o 
improve accuracy on the training set, increase the size of the hidden 
;ayer(s). Alternately, to improve generalization capabilities and 
thus improve performance on new cases, reduce the size of the 
hidden layers). The optimal size for the hidden layer(s) is a balance 
between the objectives of accuracy and generalization for each 
application. 


Table 1 Summary of trie Neural Network Training 


Network 

Name 

Percentage 
Load Data 

Training 

Cycles 

RMS 

Training 

Error 

5PA05.NET 

100% 

733 

5% 

5P_A.03.NET . 

100% 

862 

3% 

BP_A_01.NET 

100% 

1,019 

1% 

BPB05.NET 

50% 

1 ,096 

5% 

BP_B_03.NET 

50% 

1 ,384 

3% 

BP_B_01.NET 

50% 

1,666 

1% 

BP.C_05.NET 

25% 

914 

5% 

BP_C.03.NET 

25% 

U49 

3% 

BP.C_01.NET 

25% 

1,613 ' 

1% 
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RESULTS 

The network architecture was optimized through trial and 
error. This resulted in an input layer of fifty neurons for the AE 
amplitude data, one hidden layer containing eight neurons for 
failure mechanism mapping, and an output layer comprised of a 
single neuron for ultimate strength. All the neurons were fully 
interconnected, making a total of 400 Interconnection weights 
between the input and the hidden layer, S between the hidden layer 
and the output, plus 9 biases. 

A summary of the neural network training is provided in Table 
]. Altogether, nine networks were trained using the normalized 
amplitude data and the normalized ultimate strengths from the five 
specimen training set. In each case the learning rate was set to 0.2 
and the momentum to 0.9. Three networks were trained using 1 00 
percent of the AE amplitude data from load inception to specimen 
failure, one to a 5 percent RMS training error, one to a 3 percent 
error, and one to a 1 percent error. This procedure was then 
repeated for three networks using the AE data up to 50 percent of 
the expected failure load and for three using the data up to 25 
percent of the expected failure load. 

Because the ultimate strength data were normalized prior to 
input, the output prediction values from the neural network were 
also in a normalized format As such, they had to be denormalized 
before any comparisons could be made. This was accomplished by 
inverting Equation 9: 

(10) <Su) F *d = (Su)i + [(SJy - (SJJKNSJprf + 0.5] 

with the pred subscript denoting the predicted value. 

Table 2 presents a summary of the actual versus predicted 
ultimate strengths and the associated errors for the three networks 
that were trained to a 3 percent RMS training error. (The 3 percent 
RMS training error seemed to optimize the prediction capability.) 
The AE data from all eleven specimens were applied to these three 
networks. This included the five specimens used as the training set 
and the six specimens from the test set Note that the prediction 
errors on the five training set specimens were very small, as 
expected, while the prediction errors on the six test set specimens 
were of the same order of magnitude but slightly larger. The worst 
case error for the 25 percent load d a ta was +2.6 percent for specimen 
01-15. 

Because the 25 percent load data contained the least amount of 
AE data, they were expected to yield the largest error, with the 50 
percent data having the next largest error, and the 1 00 percent data 
having little or no error. This, however, was not found to be the case. 
Instead, the resulting errors w’ere approximately the same for all 
three data sets. 

Acoustic emission due to gripping noise was experienced for 
the first few hundred pounds of load before each specimen became 
seated in the test machine grips. The amount of AE activity associ- 
ated with grip slip varied from test to test While it w*as anticipated 
that this extraneous data might prevent accurate predictions, the 
neural networks w*ere able to account for this effect and accurately 
predict ultimate strengths without removing it from the data set 

A few’ suggestions may be utilized for improving the prediction 
accuracy: (1) increase the accuracy of the input load data; (2) 
increase the number of specimens in the training set; and /or (3) a d d 
another AE parameter such as signal duration to allow better 
discrimination (Hill and Ely, 1992) between the various failure 
mechanisms. The other option is to employ probabilistic neural 
networks, or PNNs (Specht, 1990; Song and Schmerr, 1992). This 
would eliminate the need for a larger training set, because a good 
statistical sampling is all that is required, and it would speed up the 
training process, since only a single forward pass through the 
network is required. Also, because their activation functions are 
normal distribuations, PNNs might provide a more accurate fit to 
the normally distributed failure mechanism humps. 

Besides accuracy, the other area of concern was that all of the 
welds tested here were essentially good welds. While porosity w*as 
observed in some of the specimens (01 -5, 01 -7, and 01 -1 2), it w’as not 
enough to significantly affect their ultimate strengths. Moreover, 
there w r ere no cracks nor any inclusions, and no areas of lack of 




Table 2 Summary of the Actual vs. Predicted Ultimate Strengths of the AluminurrHjthium Wed Specimens 


Spec- 

Actual 

100% Load 

50% Load 


25% Load 

linen 

UltStr 

Data 


Data 


Data 


(ksi) 

Predicted 

% Error 

Predicted 7 

t Error 

Predicted 

% Error 



UtStrfUi]* 


UltStr(ksi)* 


Ult Stx [ksij 


01-5+ 

51.5 

51.0 

-0.97 

51.0 

-0.97 

51.0 

-0.97 

01-6 

51.4 

50.1 

-153 

50.0 

-2.72 

50.6 

-1.56 

01-7 

51.2 

S5.o 

-2.34 

49.9 

-2.54 

49.9 

-2.34 

01 -S+ 

51.0 

50.9 

-0.20 

50.9 

-0.20 

51.0 

0.00 

01-9 

50.8 

50.6 

-0.39 

50.4 

-0.79 

50.3 

-0.9S 

01-10 

50.S 

50.3 

-0.98 

50.3 

-0.98 

50.3 

40.95 

01-11 

50.6 

50.1 

-0.99 

50.3 

-0.59 

50.4 

-0.39 

01-12+ 

49.9 

50.0 

0.20 

50.0 

o 
» j 
o 

50.0 

0.20 

01-1 3-r 

49.1 

49.5 

0.S2 

49.5 

0.S2 

49.5 

0.S2 

01-14+ 

50.4 

50.5 

0.20 

50.5 

0.20 

50.4 

0.00 

01-15 

49.5 

50.6 

2.22 

50.3 

1.62 

50.S 

2.63 

+ Training set 

• (Note: ksi units x C.006S95 

= kPa) 





penetration or lack of fusion. What is needed for future testing is 
some truly defective weld specimen s. Training on both good and 
bad samples would then allow the network to predict the effect of 
defects on the ultimate strength of aluminum-lithium welds. 

If a quantitative measure of structural integrity were needed, 
this technique could be extended to predicting the ultimate in- 
service strength (burst pressure) for the external tank itself. Using 
a backpropagation network, at least three external tanks would 
have to be hydroproofed to failure, two of which would contain 
known defects; five or six external tanks would be optimal for 
training purposes. By utilizing a probabilistic neural network, the 
number of full-scale tanks taken to failure could probably be 
minimized to three. Once trained, the neural network would be 
able to predict burst pressures in future external tanks from the AE 
data taken at hydroproof pressures well below yield (Hill, 1992). 
The obvious drawback to this scheme is the prohibitive cost of 
taking three external tanks to failure. 

Since verification, not quantification, of structural integrity is 
normally the goal, a more cost-effective approach would be to use 
AE to monitor flaw growth activity during hydroproof (ASMS, 
1988; aSTM, 1987). Since growing flaws always emit and then- 
acoustic signatures are distinctly different from other mechanisms 
and very locatable, radiography could be applied to acoustically 
active flaw growth areas only. This w f ould provide a means of 
reducing the 100 percent x-ray inspection requirement currently 
imposed on the external tank welds (Nunes et al, 1984). Imple- 
menting such a procedure on the external tank would provide 
tremendous cost savings while maintaining the 1 00 percent inspec- 
tion requirement for man-rated vehicles. 

CONCLUSIONS 

Ultimate strengths can be predicted in aluminum-li thium w-elds 
using AE amplitude data taken at loads up to 25 percent of the 
expected ultimate strength. This prediction was accomplished 
through the use of a fully interconnected backpropagation neural 
network with a single hidden layer. The network automatically 
accounted for the AE activity associated with grip slip (through the 
interconnection weights) without having to remove this extrane- 
ous data a priori from the data set. It also seemed to adjust for the 
overlap in the failure mechanism amplitudes. All of this was 
accomplished with a relatively small training set of only five 
specimens. 

The fact that the prediction errors were essentially equal for the 
networks trained on the AE data set taken up to the ultimate load 
(100 percent load data), up to 50 percent of the ultimate load (50 
percent load data), and up to 25 percent of the ultimate load (25 


percent load data) meant that the same basic ultimate strength 
information w'as inherent in all three data sets. Thus, whatever AE 
parameters were keying the network prediction of ultimate strength 
were independent of any reduction in the AE data sets. This 
suggests the possibility of obtaining accurate ultimate strength 
predictions from the AE data at proof loads even lower than 25 
percent of the expected ultimate load. 

Finally, the +2.6 percent worst case ultimate strength prediction 
accuracy^at 25 percent load w*as very dose to the ±1.7 percent (± 
0.005 V) accuracy of the input load data. Since the network predic- 
tion accuracy cannot exceed the. accuracy of the input data, the 
network trained down to a 1 percent RMS error had larger predic- 
tion errors than either the networks trained to a 3 percent or a 5 
percent error. An increase in accuracy of the input load data from 
three to four significant figures may well improve the predictions 
to within ±1 percent (KaBoo, 1988). The prediction accuracy of the 
networks was probably also constrained by the limited size of the 
training set, only five samples (as opposed to eleven for Kalloo's 
w-ork). 
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